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ABSTRACT 

This paper presents a multiscale decomposition algo- 
rithm. Unlike standard wavelet transforms, the pro- 
posed operator is both linear and shift invariant. The 
central idea is to obtain shift invariance by averaging 
the aligned wavelet transform projections over all cir- 
cular shifts of the signal. It is shown how the same 
transform can be obtained by a linear filter bank. 



1. INTRODUCTION 

Images typically contain information at many different 
scales. For applications such as image retrieval, it is 
therefore highly desirable to decompose images orthog- 
onally into subbands. This can be done using wavelet 
transforms ||. 

However, the standard wavelet transforms are shift 
variant, i.e. a small shift of the signal can change the 
wavelet transform coefficients quite dramatically. As 
a result, those coefficients cannot be used for image 
retrieval when shifts of the original image can occur. 

Several schemes have been proposed recently to in- 
troduce shift invariance into the framework of wavelet 
transforms. An early approach was to look at the 
extrema only of the wavelet transform modulus A. 
Several authors suggested the introduction of a cost 
function in order to come up with a unique best ba- 
sis ||, H, [7). A cost function that figures prominently 
is the entropy cost function used in |§. Note that the 
best basis of a structure might change if it is embedded 
into a larger structure. Filter oriented approaches have 
been introduced in Q and H]. 

Most approaches cited above use a notion of shift 
invariance, called \i — translation — invariance in Q 
and near shift invariance in [Sj, that is different from 
the standard usage. They take shift invariance to imply 
that the wavelet transform coefficients do not change 
at all if the signal is shifted. However, they extract 
the different delays of translated signals. By contrast, 
we stick with the standard definition of shift invariance 



which means that the operator output is shifted pro- 
portionally to the shift of the input signal. In [[0J , it is 
shown that shift invariance (referred to as shiftability 
in that paper) cannot be achieved by a critically sam- 
pling transform like the wavelet transform because of 
aliasing. 

In this paper, we propose a decomposition algo- 
rithm called the Averaged Wavelet Transform (AWT) 
that incorporates shift invariance in the standard sense. 
It delivers a unique representation of a signal without 
requiring a cost function. 

Systems using wavelets for the purpose of classifi- 
cation and retrieval have been described in 10, nq, O] . 



2. THE AVERAGED WAVELET 
TRANSFORM 

In one dimension, we define the AWT to be a mapping 
from a discrete signal SIG of length n to k + 1 vec- 
tors of the same length n, where k is the number of 
decomposition levels which is logarithmic in n. That 
is, 

AWT : R n -> R n x R n x . . . (k + 1 times) 

The key idea for obtaining a shift invariant signal 
decomposition is to average the projections of the signal 
onto the wavelet subspace at each scale over all circular 
shifts of the input signal. We call the result the scale 
spectra of the signal. Note that those projections are 
sometimes also referred to as detail vectors, or recon- 
structed details, or just detaife]^] 

More precisely, let WT be a discrete wavelet trans- 
form, let CSi be the circular right shift operator (per- 
forming i right shifts), with CS^ 1 analogously being 
the circular left shift operator, and let REC S be the 
reconstruction operator that projects the signal onto 



1 In Matlab, the details are reconstructed by calling the func- 
tion wrcoef (). It is the inverse of the wavelet transform. 



the wavelet subspace at scale s,s — 0..k, with the spe- 
cial case So being the DC component.^ 
Then the AWT at scale s is 

1 n—1 

AWT S = -J2 CSr 1 (REC s {WT(CS l {SIG))j) 

i=0 

The left shift operator CS"" 1 is applied to compensate 
for the shifting of the signal, i.e. we average over the 
projections that are aligned to the position of the signal 
before shifting. The complete AWT is the set of all 
AWT s ,s = 0..k. 

The shift invariance of this approach follows directly 
from the nature of the averaging process: if the input 
signal is shifted, the AWT is still computed over the 
same set of n circularly shifted signals. Also, as an 
inherent property of the averaging operation, the AWT 
is linear. Like the standard WT, the AWT decomposes 
a signal into a DC component and scale spectra of zero 
mean, with some frequency overlap between the scale 
spectra. Adding a constant to all signal values only 
changes the DC component, but not the scale spectra. 

The AWT is information preserving. The original 
signal can be easily reconstructed by just adding up 
the scale spectra, i.e. 

k 

SIG = AWT * 

s=0 

In this sense, the summation is the inverse transform 
of the AWT. Note that the AWT is highly redundant. 
It increases the space requirement by a factor of k + 1 . 
However, for most applications, the scale spectra will 
serve only as an intermediate representation. Typically, 
it suffices to keep only a few features which are derived 
from the scale spectra. 

The generalization from 1-d to 2-d is conceptually 
straightforward, albeit computationally expensive. The 
circular shifting has to be done over all shifts along the 
two axes, and the 1-d wavelet transform is replaced by 
a 2-d WT. Fortunately, an efficient algorithm to com- 
pute the wavelet coefficients of all n circular shifts of 
a vector of length n in 0(n lg n) time, rather than the 
0(n 2 ) time of the naive approach, is given in §, based 
on the observation that the shift operation is sparse. 
describes the extension of Beylkin's algorithm to 2-d. 

3. EMULATION OF THE AWT BY A 
FILTER BANK 

As pointed out above, the AWT is a shift invariant 
linear transform. As such, the AWT is well defined 

2 In Matlab, the DC component is referred to as reconstructed 
approximation coefficients. 



by a single impulse response for each scale spectrum. 
This means that we have to compute the underlying 
set of AWT filters only once, and then the AWT can 
be computed simply and efficiently by convolution of 
the signal with those AWT filters. 

The filter bank that emulates an AWT depends on 
the type of wavelet chosen for the wavelet transform 
and on the size of the signal - each signal size has its 
own corresponding AWT filter bank. The AWT filters 
are symmetric and therefore linear phase. By construc- 
tion, their length is bounded by the signal length, so 
they are FIR filters. 




Figure 1 : AWT filters (Haar) , 1-d and 2-d, consecutive 
scales, signal length is a power of two 




Figure 2: AWT filters (Daubg), 1-d and 2-d, consecu- 
tive scales, signal length is a power of two 

The simplest AWT filter bank is obtained for the 
Haar wavelet with a signal length that is a power of 
two. They turn out to be dilations of the hat function, 
shown in fig. [l] for the first three Td scales and for two 
scales in the 2-d case. In the 1-d case, the filter co- 
efficients are (-0.25 0.5 -0.25), (-0.0625 -0.125 
0.0625 0.25 0.0625 -0.125 -0.0625) for scale one 



and two, respectively. The AWT filters y^\x) at scale i 
are easily computed from the filter at scale 1 by the 
formula y^ix) = 2- ( - t ~ 1 ^y { - 1 \2~^-^x) which nicely 
reflects their wavelet transform origin. Although these 
AWT filters are the solution of a dilation equation, they 
are not wavelets Jl2| : they are orthogonal to only some 
of their translates and dilations, j This comes as no 
surprise, since it was shown in jll that shiftability can 
only be achieved by relaxing the orthogonality prop- 
erty of the wavelet transform. Clearly, the AWT is 
not a pure wavelet transform, and it doesn't provide 
for a multiresolution ||. Note also that the formula 
for AWT filters for Haar wavelets does not generalize 
to signal sizes that are not a power of two or to other 
types of wavelets. Fig. || shows some of the AWT filters 
derived from Daubechies-8 wavelets. 

4. EXPERIMENTS 

Figure ^ demonstrates the decomposition of a 1-d sig- 
nal (it happens to be a cross section of the image in 
fig. H) by the AWT, and the shift invariance of the lat- 
ter: all scale spectra of the circularly shifted signal are 
also circularly shifted by the same amount. We have 
chosen Haar wavelets here, but the AWT works for any 
wavelet. While a standard Haar wavelet transform re- 
sults in projections that are step functions, the AWT 
gives a much smoother decomposition. 

Figure |4| demonstrates the usefulness of the AWT 
for the retrieval of image substructures at various scales. 
The signal now consists of a small substructure of the 
signal in fig. with the rest of the signal being set to 
zero. Again, the AWT decomposition of this signal is 
given at the top, and the AWT of the same substruc- 
ture, circularly right shifted, at the bottom. 

It can be observed that the AWT of the substruc- 
ture matches the corresponding part of the AWT of the 
full signal to a large degree, depending on the scale. At 
small scales, the filter support is small and therefore 
localized. By contrast, at large scales, the area of sup- 
port is large, which means that the filter response to a 
small substructure is influenced to a large degree by the 
surrounding signal, i.e. adjacent structures. It is also 
evident that the substructure cut-off may introduce an 
edge artifact. 

Figure || gives an example of applying the 2-d AWT, 
based on the Haar wavelet, to an image. The six scale 
spectra plus the DC component add up to the original 
signal. 





Figure 3: (top) AWT of a signal, (bottom) AWT of 
the same signal, but circularly right shifted by 20 units. 





3 ]l2[] says on p. 619 "It [the wavelet W(x) from the hat fct.] 
is not orthogonal to W(x+1)." What was meant was that the 
scaling function is not orthogonal to its translates [personal com- 
munication with the author] . 



Figure 4: (top) AWT of a substructure of the signal 
in fig. ||, (bottom) AWT of the shifted substructure. 
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Signal DC component 




Figure 5: 2-d AWT (Haar, 128x128). Original Image, 
DC component, six dyadic scales. 



